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ABSTRACT Systems biology offers considerable promise in uncovering novel pathways by which viruses and other microbial 
pathogens interact with host signaling and expression networks to mediate disease severity. In this study, we have developed an 
unbiased modeling approach to identify new pathways and network connections mediating acute lung injury, using severe acute 
respiratory syndrome coronavirus (SARS-CoV) as a model pathogen. We utilized a time course of matched virologic, pathologi- 
cal, and transcriptomic data within a novel methodological framework that can detect pathway enrichment among key highly 
connected network genes. This unbiased approach produced a high-priority list of 4 genes in one pathway out of over 3,500 
genes that were differentially expressed following SARS-CoV infection. With these data, we predicted that the urokinase and 
other wound repair pathways would regulate lethal versus sublethal disease following SARS-CoV infection in mice. We validated 
the importance of the urokinase pathway for SARS-CoV disease severity using genetically defined knockout mice, proteomic 
correlates of pathway activation, and pathological disease severity. The results of these studies demonstrate that a fine balance 
exists between host coagulation and fibrinolysin pathways regulating pathological disease outcomes, including diffuse alveolar 
damage and acute lung injury, following infection with highly pathogenic respiratory viruses, such as SARS-CoV. 


IMPORTANCE Severe acute respiratory syndrome coronavirus (SARS-CoV) emerged in 2002 and 2003, and infected patients de- 
veloped an atypical pneumonia, acute lung injury (ALD), and acute respiratory distress syndrome (ARDS) leading to pulmonary 
fibrosis and death. We identified sets of differentially expressed genes that contribute to ALI and ARDS using lethal and sub- 
lethal SARS-CoV infection models. Mathematical prioritization of our gene sets identified the urokinase and extracellular matrix 
remodeling pathways as the most enriched pathways. By infecting Serpine1-knockout mice, we showed that the urokinase path- 
way had a significant effect on both lung pathology and overall SARS-CoV pathogenesis. These results demonstrate the effective 
use of unbiased modeling techniques for identification of high-priority host targets that regulate disease outcomes. Similar tran- 
scriptional signatures were noted in 1918 and 2009 H1N1 influenza virus-infected mice, suggesting a common, potentially treat- 
able mechanism in development of virus-induced ALI. 
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(€ evere acute respiratory syndrome coronavirus (SARS-CoV) I pneumocytes (5). Disease severity was heavily influenced by age 


= emerged in 2002 and 2003 in China after evolving from related 
virus species circulating in bats (1, 2). This novel group II corona- 
virus infected over 8,000 people worldwide, with a mortality rate 
of ~10% (3). In 2012, a novel human betacoronavirus from group 
2c named Middle East respiratory syndrome coronavirus (MERS- 
CoV) emerged in the Middle East. This virus causes severe pneu- 
monia and renal failure with an approximately 55% mortality in 
over 77 confirmed cases to date (4). SARS-CoV primarily infected 
type II pneumocytes, which are normally responsible for produc- 
tion of lung surfactants and function as the progenitor cells of type 
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and other comorbidities, as mortality rates were >50% in popu- 
lations over 65 years of age. 

Acute SARS-CoV infection resulted in denudation of airway 
epithelial cells, often accompanied by the accumulation of debris, 
which obstructed airway functions (6, 7). Progression to acute 
lung injury (ALI) and the more severe form, acute respiratory 
distress syndrome (ARDS), often involved an acute-phase diffuse 
alveolar damage (DAD), which is characterized by exudates and 
hyaline membranes in the lung alveoli (5). Biopsy and autopsy 
findings from SARS patients within the early stages of infection 
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revealed the presence of exudative-phase DAD along with in- 
creased numbers of macrophages in the lung (1, 6). Many patients 
also had lung hemorrhage, noncardiogenic pulmonary edema, 
and/or hyaline membrane formation in the alveolar spaces. Pa- 
tients with longer-term disease (>10 days postinfection [dpi]) 
manifested with proliferative- or organizing-phase DAD that en- 
compassed 25% to 100% of the lung (8). They also had signs of 
lung fibrosis in both alveolar and interstitial spaces. SARS survi- 
vors continue to show lingering effects of their illness with com- 
plications including reduced lung elasticity and function (9). 

ALI and the development of DAD may also occur with other 
respiratory virus infections, including influenza virus (H5N1 or 
H1N1-2009) and adult respiratory syncytial virus (RSV) infec- 
tion, especially in the elderly (10-12). Much like SARS-CoV infec- 
tion, disease progression is first associated with an exudative phase 
of DAD, which may progress into organizing-phase DAD (13). 
The presence of exudates in the lung, composed of fibrin and 
proteinaceous material, blocks normal gas exchange (14). With- 
out clearance of these exudates, lung pathology progresses toward 
fibrotic disease with collagen deposition and conversion of the 
exudates into fibrous tissue. Thus, ALI and ARDS represent a 
common but poorly studied host response to virus-induced severe 
lung disease. 

Correlation-based gene coexpression network inference ap- 
proaches are unbiased and powerful tools that can be used to 
analyze large multidimensional data sets, including expression 
microarray and proteomics data. Specifically, they capture com- 
plex relationships between host gene expression patterns and 
emergent correlative behaviors (15-17). These approaches posit 
that genes and protein products are organized into functional 
modules according to cellular processes and pathways. In this 
work, we use weighted gene correlation network analysis 
(WGCNA) methods to interpret the dynamic SARS-CoV- 
mediated transcriptional response. While WGCNA has previously 
been applied to cancer, mouse, and yeast genetics, we use the 
approach to enhance systems-based analysis of virus infection 
(18-20). Key to this approach is the ability to discover systemic 
gene expression patterns based on underlying correlation struc- 
tures, which is not biased by existing knowledge of pathways or 
interactions. WGCNA partitions gene expression into groups of 
transcripts called subnetwork modules with highly correlated be- 
haviors. It has been shown elsewhere that connectivity inferred 
through WGCNA is a strong predictor of related biological func- 
tion (21). A key feature of these subnetwork modules, or eigen- 
genes, is that the most highly connected hub genes within the 
module provide candidate mediators of disease (16, 17). We fur- 
ther refined these candidates by modifying the tool to identify 
enriched pathways associated with candidate lists, allowing us to 
combine connectivity and predicted network structure with bio- 
logical functions and known interactions within a hypothesis- 
testing framework. 

Pathogenesis in mouse-adapted-SARS-CoV-infected animals 
closely mimics the pathologies that were observed in human pa- 
tients (22), including age-dependent disease severity. To model 
system-wide behaviors following SARS-CoV infection, we per- 
formed a dose-response study that included biological sampling at 
multiple time points, transcriptional and proteomic systems biol- 
ogy data, and mathematical modeling algorithms to identify sig- 
naling networks associated with progression from severe to lethal 
disease outcomes. These data demonstrate the successful use of 
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highly refined modeling algorithms to identify and validate novel 
genes and pathways that play critical roles in SARS-CoV patho- 
genesis and the development of ALI following virus infection in 
the lung. 

Ina healthy lung, fibrin levels are controlled through the action 
of the enzyme plasmin and the urokinase pathway through extra- 
cellular matrix (ECM) remodeling (23). ECM remodeling also 
involves complex interactions between a number of metallopro- 
teinases and their respective regulatory protein networks (24). 
Disruption of the urokinase pathway is associated with fibrotic 
lung disease or lung hemorrhage, depending on highly directional 
signaling cascades. Tissue plasminogen activator (tPA or PLAT), 
negatively regulated by Serpinel, is given as an anticlotting agent 
to recent stroke patients to promote cleavage of plasminogen into 
plasmin and enhance breakdown of fibrin clots (25). Although 
ECM and wound healing pathway activation are known to be impor- 
tant in lung disease, these processes are largely unstudied in relation 
to in vivo respiratory virus pathogenesis. In this work, we demon- 
strate a critical role for the urokinase pathway in regulating severe 
end-stage lung disease outcomes following SARS-CoV infection. 


RESULTS 


SARS-CoV infection model. To develop an unbiased strategy to 
explore the molecular mechanisms regulating SARS-CoV patho- 
genesis and host responses, we studied virus replication kinetics, 
clinical disease severity, pathological changes, and variations in 
host transcriptomics along with targeted proteomics using a dose 
escalation study. The goal was to identify contrasting outcomes 
for modeling the role of host responses in disease. Groups of 20- 
week-old C56BL6/J (B6) mice were infected with 10? to 10° PFU of 
recombinant mouse-adapted SARS-CoV (MA15) or mock in- 
fected (phosphate-buffered saline [PBS]). Clinical changes were 
noted daily, and biological samples were collected at 0, 1, 2, 4, and 
7 days postinfection (dpi) to allow for profiling of the different 
phases of infection. Mice infected with the lowest dose, 10? PFU, 
lost little if any weight and showed few clinical signs of disease 
(Fig. 1A). Mice infected with 10? or 104 PFU experienced transient 
weight loss and showed minor clinical signs of disease. Infection 
with 105 PFU resulted in continuous weight loss with =30% 
weight loss by day 7; at late time points, these animals showed 
significant clinical disease, including hunched posture, ruffled fur, 
decreased locomotion, labored breathing, and death. 

Virus load in the lung was quantified by plaque assay and was 
reflective of both the initial infectious dose and the overall repli- 
cation kinetics. Mice infected with 10? PFU of MA15 had relatively 
low or undetectable titers (Fig. 1B). Infection with the three higher 
doses resulted in the highest titers at 1 or 2 dpi, an ~1-log drop in 
titer at 4 dpi, and a decrease of several logs by 7 dpi, regardless of 
changes in weight loss. Virus load was confirmed by quantitative 
PCR (qPCR) for both virus genome and specific viral genes (see 
Fig. S1A in the supplemental material; also data not shown). Con- 
sistent with a previous report (22), minimal to no detectable virus 
was found in the serum or other organs at 4 dpi (see Fig. S1B to D), 
and no detectable virus was present in these other organs or serum 
at day 7. 

Pathological findings. Lung pathology varied by both infec- 
tious dose and time but was not dependent on virus load. Full 
histology scoring is available in Table S1A in the supplemental 
material. At 1 dpi, infected mice showed few inflammatory cells 
and healthy conducting airways, alveoli, and vasculature. By day 2, 
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FIG 1 SARS MAI5 dose response. (A) Weight loss is shown as percent 
starting weight over the course of a 7-day infection in 20-week-old B6 mice. 
Mice infected with 10? to 10* PFU of SARS-CoV MAI5 had low levels of 
transient weight loss, while mice infected with 10° PFU showed increasing 
weight loss over time. (B) Virus titer in the lung was quantitated by plaque 
assay. The mean value of all samples with detectable virus in each group is 
shown (three mice at 10? PFU and two each at 10%, 104, and 10° PFU had 
detectable virus by plaque assay at day 7; BLD, below the limit of detection of 
100 PFU per lung). 


mice in all infection groups showed damage to the large and small 
conducting airways of the lung in a dose-dependent manner, con- 
sisting of denuded patches of airway epithelial layers and apopto- 
tic debris, often obstructing the small airways. At 4 dpi, all infected 
animals had low levels of epithelial cell denudation and airway 
debris along with mild thickening of the interstitial membranes. 
Low-level perivascular cuffing was also present at this time point, 
and moderate, sporadic hemorrhage was consistently observed in 
mice infected with 10° to 10° PFU of MA15 at 4 dpi. 

By 7 dpi, lung disease was most pronounced in the parenchyma 
in mice infected with a 10°-PFU dose and included thickening of 
the interstitial septum, inflammatory infiltrates including neutro- 
phils, and pink proteinaceous exudates, typical of DAD (see 
Fig. S1E in the supplemental material). Importantly, these mice 
had clearly visible hyaline membranes, a characteristic of fatal 
SARS-CoV infection in humans (26). These mice also showed 
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severe and complete lung hemorrhage at 7 dpi, while mice infected 
with 104 PFU had moderate but notable hemorrhage (see Ta- 
ble S1A). Mice infected with lower doses had only mild to mod- 
erate vascular cuffing and interstitial inflammation along with 
sporadic and mild hemorrhage but were otherwise absent of se- 
vere lung disease. 

Global host expression patterns. Lung gene expression pat- 
terns were compared between mice with each infection dose and 
mock-infected mice to gain insight into SARS-CoV-induced lung 
disease. Consistent with earlier reports (27), there was minimal 
differential expression (DE) at the transcript level at 1 dpi (i-e., 
|log, fold change [FC]] of >1.5 and false discovery rate [FDR] 
adjusted P value of <0.05), regardless of infectious dose (Fig. 2A). 
However, the number of DE genes escalated with increasing dose 
or number of days postinfection. Overlap of DE genes was deter- 
mined between mice infected with 10* PFU and those infected 
with 10° PFU. The maximum percentage of overlap in DE genes 
was measured at 1 and 2 dpi, despite the low number of DE genes. 
The least overlap between these doses was measured at 7 dpi 
(Fig. 2B), with 23% of genes showing DE at 10° PFU but not at 
104 PFU. We hypothesized that these genes with different expres- 
sion patterns in mice infected with 10° PFU (here referred to as the 
lethal dose) from those in mice infected with 104 PFU (here re- 
ferred to as the sublethal dose) could provide insight into the 
molecular mechanisms and host expression patterns that regulate 
lethal versus sublethal disease following SARS-CoV infection. 

Eigengene network analyses following SARS-CoV infection. 
Gene expression patterns were compared between mice infected 
with the sublethal dose and lethal dose using the WGCNA ap- 
proach (28). In the consensus network analysis, 24 module eigen- 
genes (clusters of genes with similar expression patterns over 
time) were identified (23 with statistically significant connectivity; 
P value of <0.0001 using permutation test) (Fig. 3A). Many of the 
eigengenes represented groups of genes with similar expression 
patterns and dynamics between the lethal and sublethal doses. We 
chose to focus on one specific module eigengene that displayed 
distinct dose dynamics, hypothesizing that the differences in tran- 
script expression could explain the differential disease outcome. 

The eigengene that we pursued, shown in blue and circled in 
Fig. 3A, had significantly higher expression at 2, 4, and 7 dpi in 
lethally infected mice than in sublethally infected mice (Fig. 3B; 
expression patterns for the other statistically significant eigen- 
genes can be found in Fig. S2 in the supplemental material). This 
module was prioritizing based on differential expression between 
doses at day 7 based on overall upregulation in the lethal dose 
infection and differential module regulation when comparing 
doses. Specifically, A average log, fold change (FC) was calculated 
by subtracting the average log, FC for each dose. Using this strat- 
egy, we selected the blue module because it is strongly upregulated 
for the lethal dose (1.28 average log, FC) while the sublethal dos- 
ing was only mildly upregulated (0.36 average log, FC) and the 
resulting A average log, FC was 0.93 (an almost 2-fold difference). 
This eigengene module was comprised of 760 differentially ex- 
pressed transcripts. 

When the unbiased gene list from the blue module was ana- 
lyzed by the GeneGo Metacore knowledge base, the top functional 
categories were related to cell adhesion, ECM remodeling, and 
wound healing (see Table S2 in the supplemental material). The 
same knowledge base was used to visualize the highest-priority 
ECM remodeling signaling pathways containing proteins en- 
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FIG 2 Dose-response differential gene expression. (A) Differential expres- 
sion (DE) of transcripts for each dose is shown at each day postinfection. The 
number of DE transcripts was greatest for sublethal (104-PFU) and lethal (10°- 
PFU) infections at day 2, with 2,091 and 2,251, respectively. In total across all 
4 days, there were 3,138 unique DE transcripts for the 104-PFU infections and 
3,683 for the 10°-PFU infections. (B) The heat map shows the number of 
overlapping transcripts for each time point in both sublethal- and lethal-dose 
MAIS infections. Coloring represents the odds ratio or the effect size of each 
overlap, and gray numbers within the cells are the numbers of common dif- 
ferentially expressed (DE) transcripts. Analogous to differences in phenotype 
between infection doses, the overlap is strongest at day 2 and weakest at day 7 
postinfection. 


coded by the gene set with differential signaling in lethally or sub- 
lethally infected mice (see Fig. $3). To further refine our candi- 
dates, we assessed the connectivity and module membership 
within known biological pathways. The full statistical analysis 
workflow is described in Materials and Methods, and a diagram of 
the workflow is shown in Fig. $4. The urokinase pathway was the 
most significantly enriched for both module membership (P value 
= 0.000793, odds ratio [OR] = 8.31) and transcriptional connec- 
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FIG 3 Eigengene analysis. The consensus network is represented as a den- 
drogram (A), and modules are shown as colors below. The blue module (cir- 
cled in red) displayed distinct behavior for each dose (10* and 105 PFU), indi- 
cating potential mediators of MA15 infection pathogenesis (B). The arrow in 
panel A indicates the approximate location of Serpinel and PLAT within the 
blue module. 


tivity (P value = 0.0126, 10,000 permutations), making it the ideal 
target for validation. While the eigengenes themselves were not 
priority ranked, the unbiased method that we used to prioritize 
targets within the blue eigengene model could easily be applied to 
other modules. 

The urokinase pathway contributes to the broader ECM re- 
modeling pathway shown in Fig. S3 in the supplemental material, 
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and its members, including Serpinel, Serpine2, PLAT, and 
PLAUR, along with other ECM remodeling genes, show signifi- 
cantly higher expression at 7 dpi in the lethal dose than in the 
sublethal dose (see Fig. S5A). While the urokinase pathway has not 
previously been associated with respiratory virus infection, its in- 
volvement in ischemic events and metastatic cancers has been well 
documented (29). Furthermore, fibrin turnover and alveolar co- 
agulopathy are known to significantly contribute to the severity of 
ALI following injury or chemical insult (30). Based on these fac- 
tors, we explored the impact of the urokinase pathway on SARS- 
CoV infection. 

Acute lung injury following acute severe respiratory infec- 
tion. Expanding on the identification of the urokinase pathway, 
we examined whether conserved RNA expression signatures indi- 
cated development of a procoagulant or profibrinolytic state after 
SARS-CoV infection. Tissue factor, factor VIa, and Serpinel are 
major factors responsible for the development of a procoagulant 
and antifibrinolytic state in the alveoli (31, 32). Changes in alveo- 
lar hemostatic balance and intra-alveolar fibrin deposition are also 
stimulated by cytokine expression after ALI. Proinflammatory 
(interleukin-18 [IL-1], tumor necrosis factor alpha [TNF-a], 
and IL-6) and profibrotic (transforming growth factor B [TGF-B], 
connective tissue growth factor [CTGF], and platelet-derived 
growth factor [PDGF]) cytokine transcripts were significantly el- 
evated, starting by 4 dpi and through the course of the lethal 
SARS-CoV MA15 infection (see Fig. S5B in the supplemental ma- 
terial). In addition, genes associated with the induction of a pro- 
coagulant state (thrombin, factor VIla, factor Xia, factor XIla, 
PLAU, PLAT, tissue factor F2r) (32) and other fibrinolysin path- 
way components were altered by infection. Both pro- and antifi- 
brinolytic genes in the urokinase pathway showed increased ex- 
pression in our SARS-CoV infection model, demonstrating the 
complicated nature of this signaling cascade. Previous reports 
show that it is rare for one branch of the urokinase pathway to 
have altered regulation without host feedback loops adjusting the 
expression of other pathway genes (33). In general, the transcripts 
shown in Fig. S5B represent a lung with high levels of inflamma- 
tory and fibrinolytic gene expression, and this transcriptional pic- 
ture is supported by the hemorrhage observed in the lungs of 
SARS-CoV-infected mice. 

To increase our understanding of the changes in the lung fol- 
lowing acute viral infection, we examined microarray data from 
previously published respiratory infection models of ARDS. Tran- 
scriptional data from the lungs of 1918 influenza virus-infected 
BALB/c mice showed similar patterns of differential expression in 
the genes responsible for coagulation, surfactant production, and 
fibrinolysis (see Fig. S6 in the supplemental material). Further- 
more, B6 mice infected with the 2009 H1N1 influenza virus strain 
CA04 had high levels of proinflammatory, profibrotic, and uroki- 
nase pathway gene expression relative to mock-infected mice 
along with decreased surfactant expression (doses of 10% to 
10° PFU [unpublished data]). Overall, these transcription signa- 
tures are consistent with reported viral pathological and pro- 
teomic findings, strongly supporting the development of altered 
alveolar hemostatic balance seen during the early exudative phase 
of DAD. 

Urokinase pathway in SARS-CoV pathogenesis. To further 
investigate the role of the urokinase and ECM remodeling path- 
ways, we analyzed lung proteomics. Proteomics analysis revealed 
that fibrin beta and gamma chains, factor VIII, and cytokeratins, 


July/August 2013 Volume 4 Issue 4 e00271-13 


Mechanisms of SARS-CoV-Induced Acute Lung Injury 


all major components of hyaline membranes (34, 35), have in- 
creased expression in the lungs following both lethal and sublethal 
SARS-CoV infection compared to mock infection (Fig. 4A). Sim- 
ilarly, surfactant proteins, which typically protect against hyaline 
membrane formation and lung injury (36), have reduced expres- 
sion in both the lethal and sublethal doses of SARS-CoV com- 
pared to uninfected controls. However, at late time points, several 
of these same proteins sharply diverge between the lethal and sub- 
lethal doses. Fibrin beta and gamma chain levels are increased in 
abundance relative to mock infection only at day 7 in the lethal 
infection. In contrast, fibrin chains continually decrease after 2 dpi 
in sublethal infection. A similar pattern is observed with keratin 7, 
with increased protein abundance observed in the lethal dose. To 
further confirm these findings, we used a Martius scarlet blue 
(MSB) stain to demonstrate the presence of fibrin deposition in 
the lungs at day 7. Positive staining was observed in the lungs of 
lethally infected mice but not in the lungs of mock-infected or 
sublethally infected animals (Fig. 4B). Together, the proteomics 
data are consistent with pathophysiological changes resulting in 
increased fibrin deposition and hyaline membrane formation in 
the lungs of lethally infected mice. 

Fibrin is cleaved and degraded by plasmin, having been con- 
verted from plasminogen by urokinase pathway members (23). 
Analysis of plasminogen peptides revealed a distinct and signifi- 
cant increase following a lethal dose of SARS-CoV (Fig. 4A). In 
contrast, the sublethal infection resulted in an initial augmenta- 
tion but then maintained a low level of plasminogen peptides 
throughout the course of infection. It is important to note that 
plasminogen and plasmin are indistinguishable using these pro- 
teomics methods. Fibronectin, another component downstream 
of plasmin activation, also demonstrated increased protein ex- 
pression in the lethal dose compared to the sublethal dose at day 7. 
These data are consistent with augmented PLAU/urokinase activ- 
ity in the lethal dose despite maintaining similarly elevated levels 
of Serpinel, an inhibitor of the urokinase pathway. Finally, dis- 
eases associated with premature or excessive breakdown of fibrin 
clots are characterized by hemorrhage and the accumulation of 
fluid exudates in the alveoli. Consistent with the observation of 
vascular leakage into alveolar spaces and the development of 
DAD, levels of the serum protein albumin found in the lung were 
significantly elevated in lethally infected mice (Fig. 4A). 

Targeted knockout mice. To complete the systems biology 
loop (broad study, model/hypothesis generation, and then testing 
of the model), we selected a targeted member of the urokinase 
pathway for validation studies. Genes within the blue module 
were ranked according to their overall connectivity and agreement 
with the module eigengene expression, and Serpinel was the 
highest-ranked urokinase pathway member (correlation with the 
module eigengene [K,,,.] of 0.859). The approximate location of 
Serpinel within the blue module is highlighted in Fig. 3A. Ser- 
pinel~/~ mice infected with a sublethal dose of MA15 continued 
to lose weight through day 7 postinfection (Fig. 5A) and showed 
significantly more weight loss than did B6 controls at days 5 to 7 
postinfection (P value of <0.05). Additionally, one Serpinel~/~ 
mouse died, suggesting a necessary and protective role for Ser- 
pinel in modulating SARS-CoV pathogenesis. Clinical disease in 
Serpinel~/~ mice mirrored that in lethally infected B6 animals 
(Fig. 1A), with severely decreased locomotion, hunched posture, 
and labored breathing at late time points. Despite differences in 
clinical symptoms, virus load in the lung showed no significant 
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FIG 4 (A) Identification of urokinase and tissue remodeling pathway members. (A) Peptide levels from total lung homogenates were analyzed to determine 
expression of select ECM and urokinase pathway proteins. Mock-infection values are shown by dashed lines, sublethal infection values are shown by gray lines, 
and lethal infection values are shown by black lines. Significance values: *, P< 0.05; **, P< 0.01; ***, P < 0.001; #, lethal dose significant at P < 0.05. VWF, von 
Willebrand factor. (B) Lung sections from 7-dpi lethally or sublethally infected mice or mock-infected mice were stained for the presence of fibrin using MSB 
(Martius scarlet blue). Yellow staining indicates red blood cells, blue staining indicates connective tissue, and red staining indicates fibrin. Arrows point to 


positive fibrin staining. 


differences between knockout and wild-type (WT) mice (Fig. 5B) 
at 4 or 7 dpi, demonstrating that Serpinel had no influence on 
virus replication dynamics. When B6 and Serpinel~/~ mice were 
infected with the lethal dose of SARS-CoV, the Serpinel~/~ mice 
succumbed to infection sooner than did the B6 controls (Fig. 5C) 
(P value of <0.01), further indicating a protective role for Ser- 
pinel and the urokinase pathway in SARS-CoV-induced disease. 
While 10° PFU is lethal in 20-week-old B6 mice, wild-type mice 
rarely succumb before 7 dpi. 

Examining lung sections, we observed cuffing of inflammatory 


6 mBio mbio.asm.org 


cells around the large airways and vasculature of all infected mice 
at 4 dpi (see Table S1B in the supplemental material), which in- 
creased by 7 dpi. At 4 dpi, there was mild interstitial membrane 
thickening and low numbers of inflammatory cells were observed 
in the alveolar spaces of all infected mice; by 7 dpi, disease in the 
parenchyma had developed to moderate levels. Gross hemorrhage 
levels of the lung were noted at the time of tissue harvest. Ser- 
pinel~/~ mice trended toward increased hemorrhage in their 
lung tissue compared to B6 controls at day 4, and by 7 dpi, the 
knockout mice exhibited significantly more lung hemorrhage 
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FIGS Serpinel~/~ mice are susceptible to SARS-CoV infection. (A) Serpinel~/~ mice lost more weight than did B6 control mice when infected with 104 PFU 
of SARS-CoV MA15 (P value of <0.05 for Serpinel versus B6 at days 5, 6, and 7 postinfection). (B) Serpinel~/~ mice succumbed to infection more rapidly than 
did B6 controls when infected with 10° PFU of MA15 (** = P value of <0.01). (C) Lung mean virus load was quantitated by plaque assay. There was no statistical 
difference in viral titers at 4 dpi; at 7 dpi, most mice had lung titers below the limit of detection (BLD, <100 PFU; two Serpinel~/~ mice and one B6 control mouse 
with detectable virus). Independent replicate experiments confirmed significant differences in weight loss but no difference in lung titer between Serpinel~/~ and 
B6 controls at both 4 and 7 dpi (data not shown). (D) Representative histology images from Serpinel~/~ or B6 mouse lungs at 7 days postinfection show that 
infected knockout mice had extensive hemorrhage after infection with MA15. Exudates are indicated by open arrows with dashed lines; hemorrhage is shown by 
filled arrows with solid lines. (E) Log, fold change ratio of ARDS-related gene expression from the lungs of SARS-CoV-infected Serpine1~/~ and Bé mice at 4 and 
7 dpi (log, fold change = mean log, FC [WT] — mean log, FC [knockout] ). Green indicates that expression in Serpinel-knockout mice is lower than that in B6 
mice, and red indicates that expression in Serpinel-knockout mice is higher than that in B6 mice. 
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(P value of <0.05 by Student’s f test) (Fig. 5D). Combined, these 
data confirm our model prediction that Serpinel and the uroki- 
nase pathway play an important role in the pathogenesis of SARS- 
CoV infection. 

Examination of gene expression in the lungs of Serpine1~/~ 
mice and Bé6 controls showed that SARS infection causes greater 
PLAT, IL-6, tissue factor (F3), and Serpine2 expression in the ab- 
sence of Serpinel (Fig. 5E). Serpine2 functions similarly to Ser- 
pinel in control of PLAT and PLAU activity. IL-1B, Tgfb1, and 
PLAU all had increased expression in Serpinel~/~ lungs at day 4 
relative to B6 controls but decreased expression at day 7, while 
Ptgs2 and Tnfhad decreased expression in the Serpine1~/~ mice at 
both time points. These data support the existing evidence of feed- 
back loops within the urokinase pathway as well as confirming the 
overall dysregulation of the urokinase pathway following SARS- 
CoV infection. Combined, these transcriptional changes indicate 
that SARS-CoV-infected Serpinel~/~ mice had increased fibrino- 
lytic activity in their lungs and a decreased inflammatory response 
relative to B6 controls. A simplified schematic of the urokinase 
pathway in the presence and absence of Serpinel is shown in 
Fig. 6A and B, illustrating how the lack of Serpinel could lead to 
increased PLAT and PLAU activity and increased hemorrhage and 
disease. 

To further confirm the importance of the urokinase pathway in 
SARS-CoV pathogenesis, we infected mice deficient in PLAT, one 
of the two plasmin-cleaving proteins that are activated by Ser- 
pinel. PLAT expression was also highly connected to that of the 
overall blue module with a K,,,. of 0.815. Mice deficient in PLAT 
should have a complicated phenotype because of the ability of 
PLAU to cleave fibrin and compensate in this system. Sublethally 
infected PLAT~/~ mice showed a slight, but not statistically sig- 
nificant (P = 0.06), acceleration in recovery of weight loss com- 
pared to B6 controls (see Fig. S7A in the supplemental material) 
and a significant increase in exudates in the lung (P < 0.05; see 
Fig. S7E). In contrast, PLAT~/~ mice infected with a lethal dose of 
MA15 showed notable early mortality compared to B6 controls, 
although those knockout mice that survived early infection went 
on to recover (see Fig. S7B and C). Lethally infected PLAT~/~ 
mice also trended toward less hemorrhage in their lungs than did 
B6 controls. B6 and PLAT mice infected with equal doses of MA15 
had similar virus loads in the lung at both 4 and 7 days postinfec- 
tion (see Fig. S7D), demonstrating that manipulation of the uroki- 
nase pathway did not influence virus replication dynamics but 
instead impacted the host response in regulating the development 
and resolution of ALI. 


DISCUSSION 

In this study, we highlight an unbiased modeling approach using 
systems biology to investigate viral pathogenesis and identify a 
novel host pathway involved in SARS-CoV disease progression. 
While wound healing and extracellular matrix remodeling path- 
ways have previously been associated with lung disease (32), this 
association had not extended into studies of acute respiratory vi- 
rus infection. Our data suggest that dysregulation of the urokinase 
pathway during SARS-CoV infection contributes to more severe 
lung pathology and that Serpine1 plays a protective role following 
infection. Similar changes in the urokinase, coagulation, and fi- 
brinolysin pathway expression signatures are noted following 
highly pathogenic SARS-CoV and influenza virus infections (see 
Fig. S5B and S6 in the supplemental material), arguing for a con- 
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FIG 6 Urokinase pathway model. (A) Representation of the unperturbed 
urokinase pathway signaling pathway. (B) Without the presence of Serpinel, 
an inhibitor of both PLAU/urokinase and PLAT/tPA, there is increased cleav- 
age of plasminogen into the active plasmin and thus increased breakdown of 
fibrin clots and hemorrhage compared to an unperturbed system. Red T 
shapes indicate inhibition, and blue arrows indicate activation. 


served role for these pathways in virus-induced end-stage lung 
diseases, like ALI and ARDS. 

The lack of well-defined statistical methods for organizing 
complex data sets into prioritized modules has hampered the 
power for systems-based discovery. While gene coexpression net- 
works provide one approach to capture the complex relationships 
between transcripts, the approach can also be integrated with 
other types of quantitative data, like infection outcomes, patho- 
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logical findings, and clinical disease severity, resulting in a clearer 
distinction between principal components (eigengenes) and bio- 
logical processes. An important goal of these types of integrative 
analyses is the deconstruction of large data sets into high-priority, 
constructive gene signatures that provide predictive power for 
downstream analysis and validation. However, these types of anal- 
yses have rarely moved beyond simulations produced from exist- 
ing data sets. 

We have modified a WGCNA approach to cluster similarly 
regulated genes into eigengene modules in order to compare and 
contrast module behavior as a function of virus dose and disease 
severity. We then developed strategies to prioritize targets for 
downstream analyses at the level of larger pathway analyses which 
were followed by analysis of specific genes. This approach was 
applied to our virus infection data sets containing over 3,500 DE 
genes. It resulted in the identification and characterization of four 
highly prioritized genes in the urokinase pathway whose differen- 
tial expression was significantly associated with increased disease 
severity. We note that there are a number of enrichment ap- 
proaches, such as gene set enrichment analysis (GSEA) (37), or 
hypergeometric test-based approaches, such as GOStat (38), 
which can identify genes behaving in coordinated ways to aid in 
interpreting gene candidates. However, these approaches are of- 
ten focused on differential expression in the context of functional 
annotation. With our approach, we focus on de novo-inferred 
connectivity and the distribution of the most highly connected 
module members within canonical pathways. This allows for a 
much more focused refinement of the candidate list and enables 
the generation of direct hypotheses for perturbation testing 
among the candidates. 

Our unbiased transcriptomics analysis approach identified 24 
eigengenes with different expression patterns (Fig. 3A). The eigen- 
gene network analysis and prioritization also identified several 
highly ranked innate immune signaling components, including 
MyD88, STAT1, and Ptgs2, as having high connectivity with the 
severity of SARS-CoV disease outcome. Earlier studies have inde- 
pendently demonstrated that MyD88- and STAT 1-knockout mice 
are highly susceptible to SARS-CoV pathogenesis and that block- 
ing prostaglandin function protects old mice from SARS-CoV 
pathogenesis (39-41). STAT1 signaling is known to affect cell cy- 
cle and apoptotic processes, and STAT1-knockout mice are par- 
ticularly sensitive to bleomycin-induced lung fibrosis with in- 
creased collagen levels and enhanced fibroblast proliferation (42). 
Together, these previous reports further confirm the efficacy of 
our modeling approach. 

To further validate our model, we analyzed genes from one 
module eigengene with connections to wound healing and ECM 
remodeling pathways, including the urokinase pathway, which 
displayed dose-dependent expression changes. Both histology and 
proteomic data demonstrated that urokinase activity increased in 
a dose-dependent manner following SARS-CoV infection. To 
complete a validation loop, we characterized disease outcomes in 
Serpinel~/~ and PLAT~‘~ mice. The Serpinel~/~ mice, deficient 
in a major protein in the urokinase pathway, were significantly 
more susceptible to SARS-CoV infection than were WT controls 
in terms of both weight loss and lung hemorrhage (Fig. 5A and D), 
while the PLAT~/~ mice showed a mixed pathogenic phenotype 
that suggested improved outcome but significant impairment of 
lung function (see Fig. S7A and E in the supplemental material). 
These data confirm predictions that the urokinase pathway and 
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ECM remodeling are important for regulating SARS-CoV patho- 
genic outcomes and demonstrate the delicate balance between 
development of hemorrhage and fibrosis following ALI. While 
previous studies of SARS-CoV pathogenesis showed the presence 
of fibrin in the lung following infection, the importance of Ser- 
pinel and the urokinase pathway in development of disease was 
unknown. Furthermore, the use of unbiased methods to identify 
the urokinase pathway as a regulator of SARS-CoV disease repre- 
sents an important advance in analysis of microarray data. 

ALI and its more severe form, ARDS, are devastating end-stage 
lung diseases that can arise following a variety of acute insults to 
the lung epithelium and occur frequently following infection with 
SARS-CoV and H5N1 and HIN1 influenza viruses in humans (11, 
43, 44). Despite significant advances in treatment options, the 
overall mortality rates remain substantial and range between 30 
and 60%, resulting in ~75,000 deaths in the United States and over a 
million deaths globally each year (45). While the underlying molec- 
ular mechanisms governing virus-induced ALI remain to be eluci- 
dated, our transcriptomics, proteomics, modeling, and validation 
approaches using targeted knockout mice have coalesced into a 
model of altered hemostatic balance defined by the expression of 
procoagulative and antifibrinolytic factors resulting in the induc- 
tion of an exudative phase of DAD after infection. The urokinase 
pathway regulates fibrinolytic and procoagulative responses de- 
signed to prevent vascular permeability and hemorrhage (23, 29). 

Fibrin levels dramatically increased following lethal SARS- 
CoV MA\15 infection (Fig. 4A and B). Excess fibrin was likely 
mediated by Serpinel-driven inhibition of urokinase and tissue 
type plasminogen activators (PLAU and PLAT) and by blockade 
of plasmin activity by @2-plasmin inhibitor, whose transcripts are 
elevated following SARS-CoV infection. We suggest that lethal 
SARS-CoV infection overwhelms the normally protective, profi- 
brinolytic signaling of the urokinase pathway, leading to overall 
dysregulation, including increased Serpinel expression, and se- 
vere lung disease. While fibrin is required for normal wound heal- 
ing, persistent and excessive intra-alveolar fibrin levels can con- 
tribute to acute inflammatory and chronic interstitial lung disease. 
Fibrin stimulates the production of profibrotic growth factors 
(32), and many profibrotic cytokine transcripts, like TGF-B, 
CTGF, and PDGF, are elevated following SARS-CoV or influenza 
virus infection in mice (see Fig. S5 and S6 in the supplemental 
material) (46, 47). Pulmonary surfactant protein and transcript 
signatures are also reduced following acute viral infection 
(Fig. 4A) (see also Fig. S5) (48), potentially leading to collapse or 
closure of alveoli and loss of lung compliance (changes in lung 
volumes) (49). In combination, a high-fibrin/low-surfactant 
intra-alveolar environment provides an ideal environment for fi- 
broblast adherence and growth, resulting in collagen deposition 
and development of lung fibrosis (50). Finally, fibrin and fibrin 
breakdown products increase vascular permeability, stimulate 
migration and proliferation of inflammatory cells, and promote 
recruitment of neutrophils to the lung (51, 52). Although we can- 
not absolutely ascribe this phenotype to elevation in the levels of 
fibrin and fibrin breakdown products, fluorescence-activated cell 
sorting (FACS) analysis shows significant increases in lung neu- 
trophil counts, as a function of increasing SARS-CoV dose (data 
not shown). More recent studies have also shown that Serpinel 
inhibits neutrophil apoptosis (53), suggesting that neutrophil re- 
cruitment and effector function likely contribute to more severe 
disease outcomes following SARS-CoV MA15 infection. Similar 


mBio mbio.asm.org 9 


Hio'wseoiqu Aq pausiqnd - SLO? ‘E} Jaquiajdas uo Howse olqu WOd4 papeojuMOq 


Gralinski et al. 


findings have been reported with highly pathogenic influenza vi- 
ruses, including H5N1 and 1918 influenza viruses (54). 

Fibrin accumulation in the lung is a hallmark of ALI and 
ARDS, and a reduced capacity to cleave and remove fibrin depos- 
its corresponds with a poor clinical patient outcome (31). Nota- 
bly, increased Serpinel levels were measured in the blood of 
SARS-CoV-infected patients during the 2002-2003 epidemic, and 
increased Serpinel expression has been measured in the lungs of 
SARS-CoV-infected macaques (55, 56). Necropsy findings from 
H5N1 influenza patients revealed accumulations of fibrin in the 
lung along with pulmonary edema and other signs of DAD and 
hemorrhage (57). Furthermore, 1918 and 2009 H1N1 influenza 
virus-infected mice showed increased levels of urokinase pathway 
transcripts (see Fig. S6 in the supplemental material; also data not 
shown). The host response to Streptococcus pneumoniae also in- 
cludes upregulation of Serpinel expression to protect against 
hemorrhage and ALI. The deubiquitinase CYLD provides critical 
control of Serpinel expression (58), and in the absence of CYLD, 
infected mice develop lung fibrosis. Experiments using the mouse 
model for pneumonic plague have shown that deletion of the bac- 
terial plasminogen activator, which cleaves plasminogen and also 
Serpinel, changes the course of disease and protects mice from 
lethal lung injury (59). Combined, these data demonstrate the 
critical balance of coagulative and fibrinolytic signaling in the lung 
following injury and suggest acommon modality for development 
of pathogen-induced ALI and ARDS after microbial infection. 

The systems biology approach promises to aid in our under- 
standing of complex biological processes through the high- 
throughput, unbiased analysis of broad data sets. Previous work 
has demonstrated the ability to identify distinct transcriptomic 
signatures predictive of the immune response to the yellow fever 
and influenza vaccines (60, 61). Our approach highlights the abil- 
ity to maintain statistical rigor and analytical prioritization that is 
both driven by and focused on biological relevance and experi- 
mental validation. Using this approach, mathematical models and 
their predictions, as well as the subsequent perturbations and re- 
finement, allowed us to characterize the molecular mechanisms 
that lead to the initiation and progression of infectious disease 
pathogenesis. A comparative approach with multiple viruses, such 
as SARS virus and highly pathogenic influenza virus, can identify 
common pathways that lead to severe end-stage lung disease and 
provide high-value targets for therapeutic approaches that address 
multiple pathogens. These data demonstrate the successful use of 
systems biology approaches to identify and then validate novel 
genes and pathways that play critical roles in SARS-CoV patho- 
genesis in the lung and suggest targeting urokinase and 
coagulopathy-related signaling pathways as a therapeutic ap- 
proach to treat virus-induced ALI. 


MATERIALS AND METHODS 


Viruses and cells. Recombinant mouse-adapted SARS-CoV (MA15) was 
propagated on Vero E6 cells, and its titer was determined. For virus titra- 
tion, half of the right lung was used to give PFU per lung using Vero E6 
cells with a detection limit of 100 PFU (62). All experiments were per- 
formed in a class II biological safety cabinet in a certified biosafety level 3 
laboratory containing redundant exhaust fans by workers wearing per- 
sonnel protective equipment, including Tyvek suits, hoods, and high- 
efficiency particulate air (HEPA)-filtered powered air-purifying respira- 
tors (PAPRs). 

Animals. C57BL/6J (stock no. 000664), Serpinel~/~ (stock no. 
002507), and PLAT~/~ (stock no. 002508) mice were obtained from the 
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Jackson Laboratory (Bar Harbor, ME). Mice were anesthetized with a 
mixture of ketamine and xylazine and intranasally infected with either 
50 pl of phosphate-buffered saline (PBS) alone or MA15 (e.g., 10? to 
10° PFU /animal). Animals were maintained in HEPA-filtered Sealsafe 
cages (Techniplast, Buguggiate, Italy). All animal housing and care were 
conducted in accordance with all University of North Carolina (UNC)- 
Chapel Hill Institutional Animal Care and Use Committee guidelines. 
Lung tissues from a total of 5 infected mice were harvested at each dose on 
days 1, 2, 4, and 7 postinfection in the dose-response study. Lung tissues 
from a total of 4 infected mice were harvested at days 4 and 7 postinfection 
in the knockout mouse studies. Mock-infected animals were age matched 
and harvested at each time point. Weight loss significance was determined 
by Student’s t test (Microsoft Excel), and significance in survival data was 
determined by the Mantel-Cox test (GraphPad). qPCR assays were used 
to confirm the infection status of all mice. All mouse studies were per- 
formed at the University of North Carolina (Animal Welfare Assurance 
#A3410-01) using protocols approved by the UNC Institutional Animal 
Care and Use Committee (IACUC). 

Histological analysis and hemorrhage. Gross hemorrhage of lung tis- 
sue was observed immediately after euthanasia and scored on a scale of 0 
(no hemorrhage in any lobe) to 4 (extreme and complete hemorrhage in 
all lobes of the lung). Lung tissues for histological analysis were fixed in 
10% formalin (Fisher) for at least 7 days, tissues were embedded in par- 
affin, and 5-ywm sections were prepared by the UNC histopathology core 
facility. To determine the extent of inflammation, sections were stained 
with hematoxylin and eosin (H&E) and scored in a blinded manner that is 
detailed in Text S1 in the supplemental material. Slides containing adja- 
cent sections of lung tissue were stained with Martius scarlet blue (MSB) 
to visualize fibrin by the UNC histology core. Images were captured using 
an Olympus BX41 microscope with an Olympus DP71 camera. 

RNA isolation, microarrays, quantitation of viral RNA species, full 
details of statistical analysis of gene expression data, and data dissemina- 
tion are all discussed in Text S1 in the supplemental material. 

SARS-CoV infection response network inference. To identify groups 
of host transcripts that showed coordinated regulation in response to 
infection with SARS-CoV, we applied WGCNA (21, 63). The WGCNA 
method detects signaling subnetworks or modules consisting of groups of 
genes that are highly connected according to a neighborhood proximity 
metric called the topological overlap (TO). TO quantifies the degree of 
shared network neighbors. Modules are represented as eigengenes by tak- 
ing the first principal component of each set of module transcripts, which 
describe most of the variance in the module gene expression. We have 
adapted WGCNA into a workflow with three key steps: (i) de novo net- 
work construction, (ii) consensus network analysis, and (iii) network en- 
richment (see Fig. $4 in the supplemental material). The de novo network 
construction and the derivation of signaling modules are described in 
detail in Text S1 in the supplemental material. In addition to analysis of 
the individual doses, we utilized a consensus approach to identify mod- 
ules present across both networks (see Fig. $4). Two nodes should be 
connected in a consensus network only if all of the input networks agree 
on that connection. We define the consensus network similarity between 
two nodes as the minimum of the input network similarities. This allows 
identification of regulatory differences between two networks, even when 
the resulting topology/modules are conserved. 

Novel prioritization within network modules. We prioritized the 
statistically validated host response modules to identify signaling events 
that differed between doses at day 7 based on overall upregulation of the 
105-PFU-dose treatment and differential module regulation when com- 
paring treatments. Average log, FCs were calculated across modules for 
each dose. Modules were labeled as upregulated at day 7 if the average log, 
FC was greater than zero. A A average log, FC was calculated by subtract- 
ing the average log, FC for each dose at day 7. Network enrichment was 
then performed in two stages: functional enrichment using gene ontology 
(see Text S1 in the supplemental material for details) and connectivity 
enrichment. To aid in the prioritization of the candidates to be validated 
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within the modules, we examined the connectivity distribution relative to 
known canonical pathway membership within each module. We first 
identified canonical pathways that intersected with a given WGCNA 
module and then took the average TO connectivity between intersecting 
members. Ten thousand random samplings of sets of genes of the same 
size were used to derive a null distribution and determine if a given path- 
way was enriched for TO connectivity. This provides information regard- 
ing possible biological relevance to our putative candidates that had been 
ranked within the module based on their similarity to the eigengene. 

Proteomics samples. For proteomics analysis, SARS-CoV- or mock- 
infected lung samples were collected and washed with 50 mM ammonium 
bicarbonate buffer, homogenized in 1 ml of 8 M urea in 50 mM ammo- 
nium bicarbonate buffer with glass beads for 60 s at 6,000 rpm, and then 
incubated for 1 h at room temperature. Samples were then centrifuged at 
5,000 rpm to remove debris and immediately stored at — 80°C. At each 
time point, analysis of individual mouse lungs provided intensity values 
for proteins derived from multiple peptides. The mean intensity (abun- 
dance) for each protein was then graphed as an average (n = 5 for infec- 
tion, n = 3 for mock infection) for each group at each time point. Missing 
or absent values were not scored; however, if no value was observed in any 
of the samples at a time point, the sample was registered with a single 0, 
representing “not detected.” Full details of the proteomics sample prepa- 
ration methods are in the supplemental material. 
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